SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Somerville, Chris 

Broun, Pierre 
van de Loo, Frank 

(ii) TITLE OF INVENTION: Production of Hydroxylated Fatty Acids in 

Genetically Modified Plants 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 



(A) 


ADDRESSEE : 


CUSHMAN DARBY 


Sc CUSHMAN, LLP 


(B) 


STREET : 


110 0 NEW YORK 


AVENUE , N . W , 


(C) 


CITY: 


WASHINGTON 




(D) 


STATE : 


D. C. 




(E) 


COUNTRY : 


USA 




(F) 


ZIP: 


20005-3918 





(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.44 KB storage 

(B) COMPUTER: IBM PC 

(C) OPER?^TING SYSTEM: DOS 5.0 

(D) SOFTWARE: Word Perfect 5.1 

(vi) CURRENT APPLICATION DATA; 

(A) APPLICATION NUMBER: unassigned 

(B) FILING DATE: February 5, 1996 

(C) CLASSIFICATION: 

(vii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Paul N. Kokulis 

(B) REGISTRATION NUMBER: 16,773 

(C) REFERENCE/DOCKET NUMBER: 1220/216442 : 

(viii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202 861 3000 

(B) TELEFAX: 202 822 0944 

(2) INFORMATION FOR SEQ ID NO : 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: - cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TATTGGCACC GGCGGCACCA TTCCAACAAT GGATCCCTAG AAAAAGATGA AGTCTTTGTC 6 0 

CCACCTAAGA AAGCTGCAGT CANATGGTAT GTCAAATACC TCAACAACCC TCTTGGACGC 12 0 

ATTCTGGTGT TAACAGTTCA GTTTATCCTC GGGTGGCCTT TGTATCTAGC CTTTAATGTA 18 0 

TCAGGTAGAC CTTATGATGG TTTCGCTTCA CATTTCTTCC CTCATGCACC TATCTTTAAG 24 0 

GACCGTGAAC GTCTCCAGAT ATACATCTCA GATGCTGGTA TTCTAGCTGT CTGTTATGGT 3 00 

CTTTACCGTT ACGCTGCTTC ACAAGGATTG ACTGCTATGA TCTGCGTCTA CGGAGTACCG 3 60 

CTTTTGATAG TGAACTTTTT CCTTGTCTTG GTCACTTTCT TGCAGCACAC TCATCCTTCA 42 0 

TTACCTCACT ATGATTCAAC CGAGTGGGAA TGGATTAGAG GAGCTTTGGT TACGGTAGAC 480 

AGAGACTATG GAATCTTGAA CAAGGTGTTT CACAACATAA CAGACACCCA CGTAGCACAC 54 0 

CAC 543 

(2) INFORMATION FOR SEQ ID NO : 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

TATAGGCACC GGAGGCACCA TTCCAACACA GGATCCCTCG AAAGAGATGA AGTATTTGTC 6 0 

CCAAAGCAGA AATCCGCAAT CAAGTGGTAC GGCGAATACC TCAACAACCC TCCTGGTCGC 12 0 

ATCATGATGT TAACTGTCCA GTTCGTCCTC GGATGGCCCT TGTACTTAGC CTTCAACG'Pr 18 0 

TCTGGCAGAC CCTACAATGG TTTCGCTTCC CATTTCTTCC CCAATGCTCC TATCTACAAC 24 0 

GACCGTGAAC GCCTCCAGAT TTACATCTCT GATGCTGGTA TTCTAGCCGT CTGTTATGGT 3 00 

CTTTACCGTT ACGCTGTTGC ACAAGGACTA GCCTCAATGA TCTGTCTAAA CGGAGTTCCG 3 60 

CTTCTGATAG TTAACTTTTT CCTCGTCTTG ATCACTTACT TACAACACAC TCACCCTGCG 42 0 

TTGCCTCACT ATGATTCATC AGAGTGGGAT TGGCTTAGAG GAGCTTTAGC TACTGTAGAC 4 80 

AGAGACTATG GAATCTTGAA CAAGGTGTTC CATAACATCA CAGACACCCA CGTCGCACAC 54 0 
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CACT 



544 



(2) INFORMATION FOR SEQ ID NO : 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 0 

(B) TYPE: nucleo 

(C) STRANDEDNESS : single 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATGAAGCTTT ATAAGAAGTT AGTTTTCTCT GGTGACAGAG AAATTNTGTC AATTGGTAGT 6 0 

_GACAGTTGAA GCAACAGGAA CAACAAGGAT GGTTGGTGNT GATGCTGATG TGGTGATGTG 12 0 

^JTTATTCATCA AATACTAAAT ACTACATTAC TTGTTGCTGC CTACTTCTCC TATTTCCTCC 18 0 

IMGCCACCCATT TTGGACCCAC GANCCTTCCA TTTAAACCCT CTCTCGTGCT ATTCACCAGA 24 0 

MAGAGAAGCCA AGAGAGAGAG AGAGAGAATG TTCTGAGGAT CATTGTCTTC TTCATCGTTA 3 00 

CqTTAACGTAAG TTTTTTTTGA CCACTCATAT CTAAAATCTA GTACATGCAA TAGATTAATG 3 60 

pACTGTTCCTT CTTTTGATAT TTTCAGCTTC TTGAATTCAA GATGGGTGCT GGTGGAAGAA 42 0 

j^TAATGGTTAC CCCCTCTTCC AAGAAATCAG AAACTGAAGC CCTAAAACGT GGACCATGTG 4 80 

qAGAAACCACC ATTCACTGTT AAAGATCTGA AGAAAGCAAT CCCACAGCAT TGTTTCAAGC 54 0 

^^GCTCTATCCC TCGTTCTTTC TCCTACCTTC TCACAGATAT CACTTTAGTT TCTTGCTTCT 6 00 

ACTACGTTGC CACAAATTAC TTCTCTCTTC TTCCTCAGCC TCTCTCTACT TACCTAGCTT 6 60 

GGCCTCTCTA TTGGGTATGT CAAGGCTGTG TCTTAACCGG TATCTGGGTC ATTGGCCATG 72 0 

AATGTGGTCA CCATGCATTC AGTGACTATC AATGGGTAGA TGACACTGTT GGTTTTATCT 78 0 

TCCATTCCTT CCTTCTCGTC CCTTACTTCT CCTGGAAATA CAGTCATCGT CGTCACCATT 84 0 

CCAACAATGG ATCTCTCGAG AAAGATGAAG TCTTTGTCCC ACCGAAGAAA GCTGCAGTCA 9 00 

AATGGTATGT TAAATACCTC AACAACCCTC TTGGACGCAT TCTGGTGTTA ACAGTTCAGT 9 60 

TTATCCTCGG GTGGCCTTTG TATCTAGCCT TTAATGTATC AGGTAGACCT TATGATGGTT 102 0 

TCGCTTCACA TTTCTTCCCT CATGCACCTA TCTTTAAAGA CCGAGAACGC CTCCAGATAT 10 8 0 

ACATCTCAGA TGCTGGTATT CTAGCTGTCT GTTATGGTCT TTACCGTTAC GCTGCTTCAC 114 0 

AAGGATTGAC TGCTATGATC TGCGTCTATG GAGTACCGCT TTTGATAGTG AACTTTTTCC 12 0 0 
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TTGTCTTGGT AACTTTCTTG CAGCACACTC ATCCTTCGTT ACCTCATTAT GATTCAACCG 12 6 0 

AGTGGGAATG GATTAGAGGA GCTTTGGTTA CGGTAGACAG AGACTATGGA ATATTGAACA 13 2 0 

AGGTGTTCCA TAACATAACA GACACACATG TGGCTCATCA TCTCTTTGCA ACTATACCGC 13 8 0 

ATTATAACGC AATGGAAGCT ACAGAGGCGA TAAAGCCAAT ACTTGGTGAT TACTACCACT 144 0 

TCGATGGAAC ACCGTGGTAT GTGGCCATGT ATAGGGAAGC AAAGGAGTGT CTCTATGTAG 15 0 0 

AACCGGATAC GGAACGTGGG AAGAAAGGTG TCTACTATTA CAACAATAAG TTj\TGAGGCT 15 6 0 

GATAGGGCGA GAGAAGTGCA ATTATCAATC TTCATTTCCA TGTTTTAGGT GTCTTGTTTA 16 2 0 

AGAAGCTATG CTTTGTTTCA ATAATCTCAG AGTCCATNTA GTTGTGTTCT GGTGCATTTT 16 8 0 

GCCTAGTTAT GTGGTGTCGG AAGTTAGTGT TCAAACTGCT TCCTGCTGTG CTGCCCAGTG 174 0 

AAGAACAAGT TTACGTGTTT AAAATACTCG GAACGAATTG ACCACAANAT ATCCAAAACC 18 0 0 

^=§GCTATCCGA ATTCCATATC CGAAAACCGG ATATCCAAAT TTCCAGAGTA CTTAG 18 55 



Yi2) INFORMATION FOR SEQ ID NO : 4 

i-^ (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 3 84 amino acids 

(B) TYPE: amino acid 

L (C) STRANDEDNESS : 

y (D) TOPOLOGY: linear 

nJ (ii) MOLECULE TYPE: protein 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



10 

Met Gly Ala Gly Gly Arg lie Met Val Thr Pro Ser Ser 
20 

Glu Thr Glu Ala Leu Lys Arg Gly Pro Cys Glu Lys Pro 

4 0 

Val Lys Asp Leu Lys Lys Ala lie Pro Gin His Cys Phe 

50 60 
lie Pro Arg Ser Phe Ser Tyr Leu Leu Thr Asp lie Thr 

70 

Cys Phe Tyr Tyr Val Ala Thr Asn Tyr Phe Ser Leu Leu 

90 

Leu Ser Thr Tyr Leu Ala Trp Pro Leu Tyr Trp Val Cys 



Lys Lys Ser 

30 : 
Pro Phe Thr 

Lys Arg Ser 

Leu Val Ser 
80 

Pro Gin Pro 
Gin Gly Cys 
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100 110 
Val Leu Thr Gly lie Trp Val lie Gly His Glu Cys Gly His His Ala 

120 

Phe Ser Asp Tyr Gin Trp Val Asp Asp Thr Val Gly Phe lie Phe His 

130 140 
Ser Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg 

150 160 
His His Ser Asn Asn Gly Ser Leu Glu Lys Asp Glu Val Phe Val Pro 

170 

Pro Lys Lys Ala Ala Val Lys Trp Tyr Val Lys Tyr Leu Asn Asn Pro 

180 190 
Leu Gly Arg lie Leu Val Leu Thr Val Gin Phe lie Leu Gly Trp Pro 

200 

Leu Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Phe Ala 

210 220 
Ser His Phe Phe Pro His Ala Pro lie Phe Lys Asp Arg Glu Arg Leu 

230 240 
Gin lie Tyr lie Ser Asp Ala Gly lie Leu Ala Val Cys Tyr Gly Leu 

250 

Tyr Arg Tyr Ala Ala Ser Gin Gly Leu Thr Ala Met lie Cys Val Tyr 

260 270 
Gly Val Pro Leu Leu lie Val Asn Phe Phe Leu Val Leu Val Thr Phe 

280 

Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Thr Glu Trp 

290 300 
Glu Trp lie Arg Gly Ala Leu Val Thr Val Asp Arg Asp Tyr Gly lie 

310 320 
Leu Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His 

3 3 0 

Leu Phe Ala Thr lie Pro His Tyr Asn Ala Met Glu Ala Thr Glu Ala 

340 350 
lie Lys Pro lie Leu Gly Asp Tyr Tyr His Phe Asp Gly Thr Pro Trp 

360 

Tyr Val Ala Met Tyr Arg Glu Ala Lys Glu Cys Leu Tyr Val Glu Pro 

370 380 
Asp Thr Glu Arg Gly Lys Lys Gly Val Tyr Tyr Tyr Asn Asn Lys Leu 
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/ ' ^ ■ - ■ 

(2) INFORMATION FOR SEQ ID NO : 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 87 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

10 

Met Gly Gly Gly Gly Arg Met- Ser Thr Val lie Thr Ser Asn Asn Ser 

20 30 
Glu Lys Lys Gly Gly Ser Ser His Leu Lys Arg Ala Pro His Thr Lys 

40 

Pro Pro Phe Thr Leu Gly Asp Leu Lys Arg Ala lie Pro Pro His Cys 

50 60 
Phe Glu Arg Ser Phe Val Arg Ser Phe Ser Tyr Val Ala Tyr Asp Val 

70 ' 80 

Cys Leu Ser Phe Leu Phe Tyr Ser lie Ala Thr Asn Phe Phe Pro Tyr 

90 

lie Ser Ser Pro Leu Ser Tyr Val Ala Trp Leu Val Tyr Trp Leu Phe 

100 110 
Gin Gly Cys lie Leu Thr Gly Leu Trp Val lie Gly His Glu Cys Gly 

120 

His His Ala Phe Ser Glu Tyr Gin Leu Ala Asp Asp lie Val Gly Leu 

130 140 
lie Val His Ser Ala Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser 

150 16b 
His Arg Arg His His Ser Asn lie Gly Ser Leu Glu Arg Asp Glu Val 

170 

Phe Val Pro Lys Ser Lys Ser Lys lie Ser Trp Tyr Ser Lys Tyr Ser 

180 190 
Asn Asn Pro Pro Gly Arg Val Leu Thr Leu Ala Ala Thr Leu Leu Leu 

200 

Gly Trp Pro Leu Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp 

210 220 
Arg Phe Ala Cys His Tyr Asp Pro Tyr Gly Pro lie Phe Ser Glu Arg 
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230 

Glu Arg Leu Gin lie Tyr lie Ala 

Phe Val Leu Tyr Gin Ala Thr Met 
260 

Arg lie Tyr Gly Val Pro Leu Leu 

280 

lie Thr Tyr Leu Gin His Thr His 
290 

Ser Glu Trp Asp Trp Leu Arg Gly 

310 

Tyr Gly Val Leu Asn Lys Val Phe 

Ala His His Leu Phe Ala Thr Val 
340 

Thr Lys Ala lie Lys Pro lie Met 

360 

Thr Pro Phe Tyr Lys Ala Leu Trp 
370 

Val Glu Pro Asp Glu Gly Ala Pro 
Asn Lys Tyr 



240 

Asp Leu Gly lie Phe Ala Thr Thr 
250 

Ala Lys Gly Leu Ala Trp Val Met 

270 

lie Val Asn Cys Phe Leu Val Met 

Pro Ala lie Pro Arg Tyr Gly Ser 
300 

Ala Met Val Thr Val Asp Arg Asp 

320 

His Asn lie Ala Asp Thr His Val 
330 

Pro His Tyr His Ala Met Glu Ala 

350 

Gly Glu Tyr Tyr Arg Tyr Asp Gly 

Arg Glu Ala Lys Glu Cys Leu Phe 
380 

Thr Gin Gly Val Phe Trp Tyr Arg 



(2) INFORMATION FOR SEQ ID NO : 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



10 

Met Gly Ala Gly Gly Arg Met Pro Val Pro Thr Ser Ser Lys Lys Ser 

20 30 
Glu Thr Asp Thr Thr Lys Arg Val Pro Cys Glu Lys Pro Pro Phe Ser 



40 

Val Gly Asp Leu Lys Lys Ala lie Pro Pro His Cys Phe Lys Arg Ser 

50 60 
lie Pro Arg Ser Phe Ser Tyr Leu lie Ser Asp lie lie lie Ala Ser 

70 80 
Cys Phe Tyr Tyr Val Ala Thr Asn Tyr Phe Ser Leu Leu Pro Gin Pro 

90 

Leu Ser Tyr Leu Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 

100 110 
Leu Thr Gly lie Trp Val lie Ala His Glu Cys Gly His His Ala Phe 

120 

Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu lie Phe His Ser 

130 140 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 

150 160 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 

170 

Gin Lys Ser Ala lie Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 

180 190 
Gly Arg lie Met Met Leu Thr Val Gin Phe Val Leu Gly Trp Pro Leu 

200 

Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Phe Ala Cys 

210 220 
His Phe Phe Pro Asn Ala Pro lie Tyr Asn Asp Arg Glu Arg Leu Gin 

230 240 
lie Tyr Leu Ser Asp Ala Gly lie Leu Ala Val Cys Phe Gly Leu Tyr 

250 

Arg Tyr Ala Ala Ala Gin Gly Met Ala Ser Met lie Cys Leu Tyr Gly 

260 270 
Val Pro Leu Leu lie Val Asn Ala Phe Leu Val Leu lie Thr Tyr Leu 

280 

Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp Asp 

290 300 
Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly lie Leu 

310 320 
Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His Leu 
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330 

Phe Ser Thr Met Pro His Tyr Asn Ala Met Glu Ala Thr Lys Ala lie 

340 350 
Lys Pro lie Leu Gly Asp Tyr Tyr Gin Phe Asp Gly Thr Pro Trp Tyr 

360 

Val Ala Met Tyr Arg Glu Ala Lys Glu Cys lie Tyr Val Glu Pro Asp 

370 380 
Arg Glu Gly Asp Lys Lys Gly Val Tyr Trp Tyr Asn Asn Lys Leu 



(2) INFORMATION FOR SEQ ID NO : 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 84 amino acids 

(B) TYPE: amino acid 
_ (C) STRANDEDNESS : 

O (D) TOPOLOGY: linear 

ffl (ii) MOLECULE TYPE: protein 

in (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

5 10 

Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 

ffi 20 30 

py Glu Thr Asp Asn lie Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 

{2 Val Gly Glu Leu Lys Lys Ala lie Pro Pro His Cys Phe Lys Arg Ser 

50 60 
lie Pro Arg Ser Phe Ser His Leu lie Trp Asp lie lie lie Ala Ser 

70 Bp 
Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro Asn Pro 

90 ; 
Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 

100 110 
Leu Thr Gly Val Trp Val lie Ala His Glu Cys Gly His Ala Ala Phe 

120 

Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu lie Phe His Ser 

130 140 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 
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150 

His Ser Asn Thr Gly Ser Leu Glu 



Arg Ser Gin Thr Ser Ser Gly Thr 
180 

Arg Thr Val Met Leu Thr Val Gin 

200 

Leu Ala Phe Asn Val Ser Gly Arg 
210 

His Phe His Pro Asn Ala Pro lie 

230 

lie Tyr lie Ser Asp Ala Gly lie 



Pro Tyr Ala Ala Val Gin Gly Val 



Val Pro Leu Leu lie Val Asn Gly 

m 280 

yl Gin His Thr His Pro Ser Leu Pro 

3 290 

'~ Trp Leu Arg Gly Ala Leu Ala Thr 

~ 310 

^ Asn Gin Gly Phe His Asn lie Thr 



!^ Phe Ser Thr Met Pro His Tyr His 

340 

Lys Pro lie Leu Gly Glu Tyr Tyr 

360 

Lys Ala Met Trp Arg Glu Ala Lys 
370 

Arg Gin Gly Glu Lys Lys Gly Val 



160 

Arg Asp Glu Val Phe Val Pro Arg 
170 

Ala Ser Thr Ser Thr Thr Phe Gly 

190 

Phe Thr Leu Gly Trp Pro Leu Tyr 



Pro Tyr Asp Gly Gly Phe Ala Cys 
220 

Tyr Asn Asp Arg Glu Arg Leu Gin 

240 

Leu Ala Val Cys Tyr Gly Leu Leu 
250 

Ala Ser Met Val Cys Phe Leu Arg 

270 

Phe Leu Val Leu lie Thr Tyr Leu 



His Tyr Asp Ser Ser Glu Trp Asp 
300 

Val Asp Arg Asp Tyr Gly lie Leu 

320 

Asp Thr His Glu Ala His His Leu 
330 

Ala Met Glu Ala Thr Lys Ala lie 

350 

Gin Phe Asp Gly Thr Pro Val Val 



Glu Cys lie Tyr Val Glu Pro Asp 
380 

Phe Trp . Tyr Asn Asn Lys Leu Xaa 



(2) INFORMATION FOR SEQ ID NO : 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 09 amino acids 

(B) TYPE: amino acid 

( C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 



10 

Ser Leu Leu Thr Ser Phe Ser Tyr Val Val Tyr Asp Leu Ser Phe Ala 

20 30 
Phe lie Phe Tyr lie Ala Thr Thr Tyr Phe His Leu Leu Pro Gin Pro 

40 

Phe Ser Leu lie Ala Trp Pro lie Tyr Trp Val Leu Gin Gly Cys Leu 

50 60 
Leu Thr Arg Val Cys Gly His His Ala Phe Ser Lys Tyr Gin Trp Val 

70 80 
Asp Asp Val Val Gly Leu Thr Leu His Ser Thr Leu Leu Val Pro Tyr 

90 

Phe Ser Trp Lys lie Ser His Arg Arg His His Ser Asn Thr Gly Ser 

100 110 
Leu Asp Arg Asp Glu Arg Val Lys Val Ala Trp Phe Ser Lys Tyr Leu 

120 

Asn Asn Pro Leu Gly Arg Ala Val Ser Leu Leu Val Thr Leu Thr lie 

130 140 
Gly Trp Pro Met Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp 

150 160 
Ser Phe Ala Ser His Tyr His Pro Tyr Arg Val Arg Leu Leu lie Tyr 

170 

Val Ser Asp Val Ala Leu Phe Ser Val Thr Tyr Ser Leu Tyr Arg Val 

180 190 
Ala Thr Leu Lys Gly Leu Val Trp Leu Leu Cys Val Tyr Gly Val Pro 

200 

Leu Leu lie Val Asn Gly Phe Leu Val Thr lie Thr Tyr Leu Arg Val 

210 220 
His Tyr Asp Ser Ser Glu Trp Asp Trp Leu Lys Gly Ala Leu Ala Thr 

230 240 
Met Asp Arg Asp Tyr Gly lie Leu Asn Lys Val Phe His His lie Thr 

250 

Asp Thr His Val Ala His His Leu Phe Ser Thr Met Pro His Tyr His 

260 270 
Leu Arg Val Lys Pro lie Leu Gly Glu Tyr Tyr Gin Phe Asp Asp Thr 
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280 

Pro Phe Tyr Lys Ala Leu Trp Arg Glu Ala Arg Glu Cys Leu Tyr Val 

290 300 
Glu Pro Asp Glu Gly Thr Ser Glu Lys Gly Val Tyr Trp Tyr Arg Asn 

Lys Tyr Leu Arg Val 



(2) INFORMATION FOR SEQ ID NO : 9 . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 02 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

10 

Phe Ser Tyr Val Val Tyr Asp Leu Thr lie Ala Phe Cys Leu Tyr Tyr 

20 30 
Val Ala Thr His Tyr Phe His Leu Leu Pro Gly Pro Leu Ser Phe Arg 

40 

Gly Met Ala He Tyr Trp Ala Val Gin Gly Cys He Leu Thr Gly Val 

50 60 
Trp Val Val Ala Phe Ser Asp Tyr Gin Leu Leu Asp Asp He Val Gly 

70 80 
Leu He Leu His Ser Ala Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr 

90 

Ser His Arg Arg His His Ser Asn Thr Gly Ser Leu Glu Arg Asp GlU 

100 110 
Val Phe Val Pro Lys Val Ser Lys Tyr Leu Asn Asn Pro Pro Gly Arg 

120 

Val Leu Thr Leu Ala Val Thr Leu Thr Leu Gly Trp Pro Leu Tyr. Leu 

130 140 
Ala Leu Asn Val Ser Gly Arg Pro Tyr Asp Arg Phe Ala Cys His Tyr 

150 160 
Asp Pro Tyr Gly Pro He Tyr Ser Val He Ser Asp Ala Gly Val Leu 

170 

Ala Val Val Tyr Gly Leu Phe Arg Leu Ala Met Ala Lys Gly Leu Ala 
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180 190 
Trp Val Val Cys Val Tyr Gly Val Pro Leu Leu Val Val Asn Gly Phe 

200 

Leu Val Leu lie Thr Phe Leu Gin His Thr His Val Ser Glu Trp Asp 

210 220 
Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly lie Leu 

230 240 
Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His Leu 

250 

Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Val Glu Tyr 

260 270 
Tyr Arg Phe Asp Glu Thr Pro Phe Val Lys Ala Met Trp Arg Glu Ala 

280 

Arg Glu Cys lie Tyr Val Glu Pro Asp Gin Ser Thr Glu Ser Lys Gly 

^ 290 300 

Val Phe Trp Tyr Asn Asn Lys Leu Ala Met Glu Ala Thr Val 



y2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 
1_ (A) LENGTH: 372 amino acids 

y (B) TYPE: amino acid 

(C) STRANDEDNESS : 
fy (D) TOPOLOGY: linear 

O (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOrlO: 

10 

Met Gly Ala Gly Gly Arg Met Thr Glu Lys Glu Arg Glu Lys Gin Glu 

20 30 ^ 

Gin Leu Ala Arg Ala Thr Gly Gly Ala Ala Met Gin Arg Ser Pro Val 

40 

Glu Lys Pro Pro Phe Thr Leu Gly Gin lie Lys Lys Ala lie Pro Pro 

50 60 
His Cys Phe Glu Arg Ser Val Leu Lys Ser Phe Ser Tyr Val Val His 

70 80 
Asp Leu Val lie Ala Ala Ala Leu Leu Tyr Phe Ala Leu Ala lie lie 

90 

Pro Ala Leu Pro Ser Pro Leu Arg Tyr Ala Ala Trp Pro Leu Tyr Trp 
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100 

lie Ala Gin Gly 

Gly Leu Val Leu 
130 

Tyr Ser His Arg 

Glu Val Phe Val 

Tyr Val Tyr Asn 
180 

Leu Thr Leu Gly 

Pro Tyr Pro Arg 
210 

Asn Asp Arg Glu 

Ala Val Ala Phe 

Trp Val Val Arg 
260 

Leu Val Leu lie 

Tyr Asp Ser Ser 
290 

Asp Arg Asp Tyr 

Thr His Val Ala 

Met Glu Ala Thr 
340 

Phe Asp Pro Thr 

Cys lie Tyr Val 
370 

Lys Lys Phe Xaa 



Ala Phe Ser Asp 
120 

His Ser Ser Leu 

Arg His His Ser 
150 

Pro Lys Lys Lys 

Asn Pro Val Gly 

Trp Pro Leu Tyr 
200 

Phe Ala Cys His 

Arg Ala Gin lie 
230 

Gly Leu Tyr Lys 

Val Tyr Ala Val 

Thr Tyr Leu Gin 
280 

Glu Trp Asp Trp 

Gly lie Leu Asn 
310 

His His Leu Phe 

Lys Ala lie Arg 

Pro Val Ala^ Lys 
360 

Glu Pro Glu Asp 



Tyr Ser Leu Leu 

Met Val Pro Tyr 
140 

Asn Thr Gly Ser 

Glu Ala Leu Pro 
170 

Arg Val Val His 

Leu Ala Thr Asn 

Phe Asp Pro Tyr 
220 

Phe Val Ser Asp 

Leu Ala Ala Ala 
250 

Pro Leu Leu lie 

His Thr His Pro 

Leu Arg Gly Ala 
300 

Arg Val Phe His 

Ser Thr Met Pro 
330 

Pro lie Leu Gly 
Ala Thr Trp Arg 
Arg Lys Gly Val 



110 

Asp Asp Val Val 

Phe Ser Trp Lys 

Leu Glu Arg Asp 
160 

Trp Tyr Thr Pro 

lie Val Val Gin 
190 

Ala Ser Gly Arg 

Gly Pro lie Tyr 

Ala Gly Val Val 
240 

Phe Gly Val Trp 

Val Asn Ala Trp 
270 

Ser Leu Pro His 

Leu Ala Thr Met 

Asn lie Thr Asp 
320 

His Tyr His Ala 

Asp Tyr Tyr His 
350 

Glu Ala Gly Glu 
Phe Trp Tyr Asn 
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(2) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Trp Val Met Ala His Asp Cys 
20 

Leu Leu Asp Asp Val Val Gly 



Q Pro Tyr Phe Ser Trp Lys His 

fi 50 

fQ Gly Ser Leu Glu Arg Asp Glu 

M 70 

m lie Arg Trp Tyr Ser Lys Tyr 



Thr lie Ala Val Thr Leu Ser 
12 Asn Val Ser Gly Arg Pro Tyr 



Tyr Gly Pro lie Tyr Asn Asp 
130 

Asp Ala Gly Val Leu Ala Val 

150 

Ala Lys Gly Leu Ala Trp Val 



Val Val Asn Ser Phe Leu Val 
180 

Pro Ala Leu Pro His Tyr Asp 



Ala Leu Ala Thr Val Asp Arg 
210 

His Asn lie Thr Asp Thr Gin 



10 

Gly His His Ala Phe Ser Asp Tyr Gin 

30 

Leu lie Leu His Ser Cys Leu Leu Val 
40 

Ser His Arg Arg His His Ser Asn Thr 

60 

Val Phe Val Pro Lys Lys Lys Ser Ser 

80 

Leu Asn Asn Pro Pro Gly Arg lie Met. 
90 

Leu Gly Trp Pro Leu Tyr Leu Ala Phe 

110 

Asp Arg Phe Ala Cys His Tyr Asp Pro 
120 

Arg Glu Arg lie Glu lie Phe lie Ser 

140 

Thr Phe Gly Leu Tyr Gin Leu Ala lie 

160 

Val Cys Val Tyr Gly Val Pro Leu Leu 
170 

Leu lie Thr Phe Leu Gin His Thr His 

190 

Ser Ser Glu Trp Asp Trp Leu Arg Gly 
200 

Asp Tyr Gly lie Leu Asn Lys Val Phe 

220 

Val Ala His His Leu Phe Thr Met Pro 
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(2) 



INFORMATION FOR SEQ ID NO: 12 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GCTCTTTTGT GCGCTCATTC 



20 



(2) 



INFORMATION FOR SEQ ID NO: 13 



(i) SEQUENCE CHARACTERISTICS 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



20 nucleotides 
nucleotide 
single 
linear 

DNA 



iT (ii) MOLECULE TYPE: 

(xi) sequence description: seq id no: 13 
Q:ggtaccaga aaacgccttg 



20 



02) 



INFORMATION FOR SEQ ID NO: 14 



(i) SEQUENCE CHARACTERISTICS 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 
(xi) SEQUENCE DESCRIPTION: 
TAYWSNCAYM GNMGNCAYCA 



20 nucleotides 
nucleotide 
single 
linear 

DNA 

SEQ ID NO: 1-4 : 



20 



(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

"■' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

RTGRTGNGCN ACRTGNGTRT C 21 



o 

03 

m 
m 

w 
o 

ui 

m 
a 
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